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A NEW CLASS OF LOW CALORIE PROTEIN SWEETENERS 

Technical Field 

10 The invention relates to using proteins as 

substitutes for sugar in sweetening foods and beverages. 
In particular , it concerns single chain proteinaceous 
compounds which are much sweeter than sucrose, and which 
retain their sweetening capacity although subjected to 

15 denaturing conditions. 

Background of the Invention 

It is well known that certain proteinaceous 
compounds have the ability to substitute in a highly ef- 

20 fective manner for sugar in giving foods and beverages a 
sweet taste. The simplest of these examples is aspartame, 
which is a dipeptide derivative and currently on the 
market. However, two much more complex proteins, monellin 
and thaumatin have been isolated from plant sources . 

25 Thaumatin is isolated from Thaumatococcus danie 1 1 i i , a 
West African plant having triangular shaped fruit at 
ground level. The natural, protein product, thaumatin, has 
an av^eragie sweetness of 2500 times that of sucrose and has 
been marketed under the trademark Talin. The three 

30 dimensional structure of this protein has been studied and 
the results published by De Vos, A.M., et al, Proc Natl 
Acad Sci USA (1985) 82:1406-1409. 

The other protein, which is nonglycosylated, is 
isolated from "Serendipity Berries" of the West African 

35 Plant Dioscoreophyllum comminisii . The amino acid 
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sequence of monellin is known, and the three dimensional 
structure of this protein has been determined by Ogata, 
C* r et al, Nature (1987) 328:739-742. Monellin has been * 
characterized by Morris et al, J Biol Chem (1973) 248 : 534- 
5 539 , and by others; Cagan, Science (1973) 181; 32-35; Bohak ^ 
and Li r Biochim Biophys Acta (1976) 427 : 153-170; Hudson 
and Beeman, Biochem Biophys Res Comm (1976) 71:212-220; 
Van der Wei and Loeve, FEBS Lett (1973) 29:181-183; Frank 
and Zuber Hoppe-Seyler's Z Physiol Chem (1976) 357 :585- 

10 592; Morris and Cagan, Biochim Biophys Acta (1972) 

261:114-122. U.S. Patent 3,998,798 describes the prepara- 
tion of natural monellin . 

The known amino acid sequence of the A and B 
chains of natural monellin is shown in Figure 1. it is a 

15 two chain peptide, one "A" chain containing 45, and the 
other "B" chain, 50 amino acid residues. The three 

dimensional conformation of the protein, shown in Figure / 
2, is evidently essential for its activity because when 
native monellin is heated to 90°C at neutral pH or to 50°C 

20 at acidic pH and then cooled, the sweetness is destroyed. 

The three-dimensional conformation of the 
protein as recently determined (supra) is shown in 
Figure 2. A B chain containing 50 amino acids is 
intimately associated with the A chain of 45 amino acids 

25 in such a way that there are many inter-chain inter- 
actions. Heating of the protein, evidently dissociates 
the chains in such a way that they are incapable of 
reforming into the appropriate conformation. 

It has now been found that the conformation of 

30 this proteinaceous compound can be maintained by 

synthesizing portions of the B and A components on a 
single molecule. This constraint results in resistance to 
denaturation and ease of maintaining the three dimensional * 
conformation. 



35 
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Disclosure of the Invention 

The invention provides single chain forms of the 
sweet proteinaceous substance, monellin, which are capable 
of maintaining their three dimensional conformation under 
5 conditions which would ordinarily denature the native 
protein. These forms retain their sweet taste, which is 
substantially more intense than that of sucrose, even 
after heating to 100°C at acidic pH. Thus, the substance 
remains useful for sweetening of foods and beverages 

10 including carbonated beverages exposed to temperatures 
above ambient. 

In one aspect the invention is directed to a 
single chain proteinaceous compound having a sweetness 
property at least 50, preferably 100, and most preferably 

15 1000 times that of sucrose on a weight basis. The 

compound has the formula B-C-A wherein B is substantially 
equivalent to the residues 1-46 of the B chain portion of 
native monellin and is linked through the C-terminus to C. 
C is a covalent linker which is, in turn, linked to the N- 

20 terminus of a peptide substantially equivalent to residues 
6-45 of the A chain of native monellin. The covalent 
linker, C, must be of approximately the length of 3-10 
alpha-amino acid peptide, of sufficient hydrophility to 
reside on the outer side of the molecules, and physiologi- 

25 cally acceptable. If the covalent linker is itself a 

peptide the substance will be a single chain protein which 
can be prepared using automated synthetic methods or can 
be prepared from a synthetic gene: Rising/ recombinant 
techniques, or by other methods known in the art . 

30 As recombinant methods may also be used, the 

invention, in another aspect, includes DNA sequences en- 
coding the protein, expression vectors containing these 
sequences, microorganisms or other host cells transformed 
with these vectors, and methods to produce the protein 

35 using these materials. The invention also includes useful 
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intermediates in the synthesis of the compounds of the 
invention. 

Brief Description -of the Drawings 
5 Figure 1 shows the amino acid sequence of the A 

and B chains of the native protein. 

Figure 2 is a representation of the three 
dimensional conformation of the native protein. 

Figure 3 shows the amino acid sequence of a 
10 single chain protein of the invention, and the nucleotide 
sequence of a synthetic gene useful in synthesizing said 
single chain protein. 

Modes of Carrying Out the Invention 

15 The proteinaceous compounds of the invention, 

useful as sweeteners, consist essentially of a peptide 
portion substantially equivalent to the sequence of 
residues 1-45 of the B chain of native monellin linked 
through the C-terminus by a covalent linker to the 

20 * terminus of a peptide substantially equivalent to the 

sequence of residues 6-45 of A chain of native monellin. 
By "substantially equivalent" is meant a peptide which, in 
the context of the compounds of the invention, results in 
a substance having a sweetening power at least 50 times 

25 that of sucrose and which has at least 80% homology with 
the peptide represented by residues 1-46 of the B chain or 
to the peptide represented by residues 6-45 of the A 
chain. At least 90% homology is preferred especially 
including conservative substitutions. 

30 Homology is calculated by standard methods which 

involve aligning two sequences to be compared so that 
maximum matching occurs r and calculating the percentage of 
matches. Thus, in a particularly preferred embodiment, 
the substances of the invention comprise a peptide having 

35 the amino acid sequence of residues 1-46 of the native 
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monellin B chain linked (through the linker) to a peptide 
having the primary structure represented by residues 6-45 
of the native monellin A chain. Substantially equivalent 
substances to these include those wherein one or more of 
5 the residues of the native sequence is deleted, 

substituted for, or inserted by a different amino acid or 
acids . 

Preferred substitutions are those which are 
conservative, i.e., wherein a residue is replaced by 

10 another of the same general type. As is well understood, 
naturally occurring amino acids can be subclassif ied as 
acidic, basic, neutral and polar, or neutral and nonpolar. 
Furthermore, three of the encoded amino acids are 
aromatic. It is generally preferred that peptides differ- 

15 ing from the native sequence contain substitutions which 
are from the same group as that of the amino acid 
replaced. Thus, in general, the basic amino acids Lys, 
Arg, and His are interchangeable; the acidic amino acids 
aspartic and glutamic are interchangeable; the neutral 

20 polar amino acids Ser, Thr, Cys, Gin, and Asn are inter- 
changeable; the nonpolar aliphatic acids Gly, Ala, Val, 
lie, and Leu are interchangeable; and the aromatic amino 
acids Phe, Trpr and Thr are interchangeable. While 
proline is a nonpolar neutral amino acid, it represents 

25 difficulties because of its effects on conformation, and 
substitutions by or for proline are not preferred, except 
when the same or similar conformational results can be 
obtained. In addition, although classified in di f f erent 
categories, Ala, Gly, and Ser seem to be interchangeable , 

30 and Cys additionally fits into this group. Some substitu- 
tions by amino acids from different classes may be useful 
to modify sweet taste responses . 

It should further be noted that substitutions by 
amino acids which are not encoded by the gene may also be 

35 made. Alternative residues include, for example, the 
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omega amino acids of the formula H 2 N ( CH 2 ) n COOH wherein n 
is 2-6. These are neutral , nonpolar amino acids r as are 
sarcosine (Sar), t-butyl alanine (t-BuA), t-butyl glycine 
(t-BuG), N-methyl lie (N-Melle), and nor leucine (Nle) . 
5 Phenyl glycine, for example, can be substituted for Trp, 
Tyr or Phe an aromatic neutral amino acid; citrulline 
(Cit) and methionine sulfoxide (MSO) are polar but 
neutral, cyclohexyl alanine (Cha) is neutral and nonpolar, 
cysteic acid (Cya) is acidic, and ornithine (Orn) is 

10 basic* The conformation conferring properties of the 

proline residues may be retained if one or more of these 
is substituted by hydro xypro line (Hyp) • 

In addition, one or more of the peptide linkages 
in the B or A peptide-corresponding portions of the 

15 substances of the invention may be replaced with other 

types of linkages which retain the same general properties 
such as -CH 2 NH-, -CHjS-, -CH 2 CH 2 -, -CH=CH-, -C0CH 2 -, 
-CH(0H)CH 2 - r and -CH^SO- by methods known in the art. 
See, for example, Spatola, A,F. Vega Data (1983), 1(3), 

20 "Peptide Backbone Modifications"; Spatola, A.F., in 

"Chemistry and Biochemistry of Amino Acids, Peptides and 
Proteins n , B* Weinstein, ed. r Marcel Dekker r New York, p 
273 (1983); Morley, J . S * , Trans Pharm Sci (1980), pp 463- 
468* These substitutions may be made so long as the 

25 resulting "protein" is physiologically acceptable. 

It should be further noted that if the protein 
embodiments of the invention are produced recombinantly as 
intracellular proteins/ and N- teinninal methionine residue 
may be retained in the finished product. Cleavage of this 

30 N-terminal methionine to liberate the native sequence may 
or may not be complete . 

The covalent linkers of the invention are, 
generally, of a length equivalent to that generated by a 
peptide of 3-10 amino acid residues, preferably 6-8 amino 

35 acid residues, and must be physiologically acceptable. By 
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"physiologically acceptable" is meant non-toxic and free 
of undesirable side effects. The covalent linker should 
have a hydrophilicity and length which permits the 
covalent linker to reside on the external portion of the 
5 molecule and not to distort the native conformation* One 
preferred covalent peptide linking sequence is shown 
below: 



Tyr-Glu-Asn-Glu-Arg-Glu-Ile-Lys . 

10 

A particularly preferred group of covalent 
linkers represented by C comprises a peptide sequence 
containing 3-10, preferably 6-8 , of the formula 

15 A 1 - A2 - A3 - A4 -A5-A6 - A7 - A8 - A9 - Al 0 

wherein each A1-A10 may be an amino acid residue or may be 
absent, but at' least three of A1-A10 must be amino arc id * 
residues- In a particularly preferred group of 

20 embodiments, A9 and A10 are absent, A2, A4 and A6 are 

acidic amino acids; A5 and A8 are basic amino acids, A3 is 
a polar /neutral amino acid, and Al and A7 are nonpolar 
amino acids. In a particularly preferred set of 
embodiments, A9 and A10 are absent, Al is Tyr or Phe, A2 

25 is Glu or Asp, A3 is Asn or Gin, A4 is Glu or Asp, A5 is 
Arg, His, or Lys, A6 is Glu or Asp, A7 is lie, Val, or 
Leu, and A8 is Arg, Lys , or His . 

Unless otherwise noted, the amino acid residues 
whether encoded by the gene or as nonencoded analogs, are 

30 in the L-conf iguration . However, two of such residues, 
preferably one, may be substituted by the D-isomer. 

Alternative linkers include any covalent linking 
moiety which is of the appropriate length as described 
above, appropriate hydrophilicity, and physiological ac- 

35 ceptability. For example, polyethylene glycol oligomers 
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15 



of the formula HO(CX 2 CX 2 0) n H wherein each X is independ- 
ently H or a saturated or unsaturated hydrocarbyl of 1-4C 
and wherein n is 3-10 , preferably 6-8 , can also be used. 
Oxidation of one of the terminal hydroxyl groups provides 
a carboxyl for formation of the amide to the N-terminus of 
the A chain; linkage to the B chain is through an ester 
linkage with the remaining hydroxyl of the polyethylene 
glycol, A variety of covalent linkers of the correct 
f unctionality, hydrophilicity and length can be provided 
and used to effect covalent linkage of the C- terminus of 
the B chain with the N-terminus of the A chain. A variety 
of covalent linkers, for example,, is available com- 
mercially from Pierce Chemical Company. 

The B and A peptides of the invention compounds, 
and the C linking portion, if it, too, is a peptide, can 
be synthesized using standard automated peptide synthesis 
techniques, either manually or on automated synthesizers 
such as those supplied by Applied Biosystems (Foster City, 
California) r or Biosearch (San Rafael, California). 
Resins derivatized to the C-terminal amino acid in 
protected form are commercially available. Thus, for 
synthesis of the A chain portion, resins derivatized with 
proline are preferred; for the B portion, in a preferred 
25 embodiment, those derivatized to lie. The subsequent 
amino acids are added using standard techniques to 
synthesize peptides of the required length. The A and B 
chains can each be synthesized in to to or as fragments 
which are then ligated r and subsequently linked with the 
covalent linker to obtain the substances of the invention. 

If prepared using recombinant techniques, a DNA 
sequence encoding the desired peptide portion r in some 
cases the full-length peptide, is synthesized using 
standard automated techniques. This DNA is ligated into 
35 suitable expression vectors and these vectors are 

transformed into appropriate hosts . A variety of expres- 
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sion vector/host cell systems can be used, including both 
procaryotic and eucaryotic culture systems. 

Procaryotes most frequently are represented by 
various strains of E. coli . However, other microbial 
strains may also be used, such as bacilli, for example 
Bacillus subtilis , various species of Pseudomonas , or 
other bacterial strains. In such procaryotic systems, 
plasmid vectors which contain replication origins, and 
control sequences derived from a species compatible with 
the host are used. For example, E. coli is typically 
transformed using derivatives of pBR322, a plasmid derived 
from an E . coli species by Bolivar et al. Gene (1977) 
2:95 . Commonly used procaryotic control sequences, which 
are defined herein to include promoters for transcription 
initiation, optionally with an operator, along with 
ribosome binding site sequences, include such commonly 
used promoters as the beta-lactamase (penicillinase), 
lactose (lac) promoter systems (Chang et al, Nature (1977) 
19.8:1056), the tryptophan (trp) promoter system (Goeddel 
et al, Nucleic Acids Res (1980) 8:4057), and the lambda- 
derived P^ promoter and N-gene ribosome binding site 
(Shimatake et al. Nature (1981) 292:128). However, any 
available promoter system compatible with procaryotes can 
be used. 

The expression systems useful in the eucaryotic 
systems of the invention comprise promoters derived from 
appropriate eucaryotic genes . A class of promoters useful 
in yeast, for example, include promoters for synthesis of 
glycolytic enzymes, including alcohol dehydrogenase-! 
promoter, glyceraldehyde-3-phosphate dehydrogenase 
promoter (Holland & Holland, J Biol Chem (1980) 255:2596) , 
alpha-factor promoter (Bitter et al, Proc Natl Acad Sci 
(1984) £1:5330), the gal promoter (Johnston & Davis, Mol 
Cell Biol (1984) 4:1440) those f or 3-phosphoglycerate 
kinase (Hitzeman, et al, J Biol Chem (1980) 255:2073). 
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Other promoters include those from the enolase gene (Hol- 
lands M.J., et al r J Biol Chem (1981) 256; 1385) or the 
Leu2 gene obtained from YEpl3 (Broach, J, , et al, Gene 
( 1978) 8:121) . 

5 Suitable mammalian promoters include the early 

and late promoters from SV40 (Fiers et al, Nature » ( 1978) 
273 : 113) or other viral promoters such as those derived 
from polyoma, adenovirus II f bovine papilloma virus or 
avian sarcoma viruses. Suitable viral and mammalian 

10 enhancers are cited above* In the event plant cells are 
used as an expression system/ the nopaline synthesis . 
promoter is appropriate (Depicker, A., et al, J Mol Appl 
Gen (1982) 1:561). 

Depending on the host cell used, transformation 

15 is done using standard techniques appropriate to such 

cells. The calcium treatment employing calcium chloride, 
as described by Cohen, S.N. , Proc Natl Acad Sci USA (1972) 
69_x 21 10 r or the RbCl method described in Maniatis et al, 
Molecular Cloning; A Laboratory Manual (1982) Gold Spring 

20 Harbor Press, p. 254 is used for procaryotes or other 
cells which contain substantial cell wall barriers. 
Infection with Aqrobacterium tumefaciens (Shaw, C.H., et 
al. Gene (1983) 23:315) is used for certain plant cells. 
For mammalian cells without such cell walls, the calcium 

25 phosphate precipitation method of Graham and van der Eb, 
Virology (1978) 52:546 is preferred. Trans formations into 
yeast are carried out according to the method of Van 
Solingen, P- , et al, J Bacter (1977) 130*946; and Hsiao , 
C.L. r et al, Proc Natl Acad Sci USA (1979) 76:3829. 

30 In general, after construction of a suitable 

expression system, the system is trans fected into the ap- 
propriate host and successful transf ormants are selected 
by markers contained on the expression vectors. Success- 
fully transformed colonies are then cultured in order to 

35 produce the desired protein. It is preferred that a 
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promoter which can be controlled by regulating conditions 
in the environment be used so that the cells can be grown 
under conditions where the gene encoding the desired 
protein of the invention is not expressed, and then 
5 production of the protein induced by appropriate manipula- 
tion of conditions. For example, if the trp prompter is 
used in E. coli , the cells are grown in the presence of 
tryptophan and expression is then induced by diminution of 
tryptophan concentration or by addition of a tryptophan 
10 analog such as indolylacetic acid. If the gene is under 
control of the P L promoter, the cells are grown at 
relatively low temperature, such as at about 35°C, to a 
suitable cell density, and the temperature is then 
elevated to activate this promoter. If produced in bacte- 
15 rial hosts as a mature intracellular protein, the N- 

terminal methionine may or may not be cleaved. In mam- 
malian systems, for example, the use of the 
metallothionein promoter permits induction by addition of 
heavy metals or glucocorticoids.* This protocol is 
20 preferred to prevent premature accumulation of the protein 
which may be harmful to the growth of the cell. 

The protein may also be produced in secreted 
form by construction of vectors wherein the B peptide is 
preceded by a signal peptide workable in the appropriate 
25 host. 

The protein is recovered from the medium or from 
the cells using suitable techniques generally known in the 
art , and pxirif led by, for example, ion exchange .• ; • 

chromatography, ammonium sulfate precipitation, gel 

30 permeation chromatography, and so forth. The sweetening 
compounds of the invention may also be used to generate 
antibodies immunospecif ic for themselves which are useful 
in purification of the compounds from synthetic or 
bioculture media. These antibodies may be prepared using 

35 standard immunization techniques by mixing the compound 
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with suitable excipients and injecting the preparations 
into vertebrate hosts, typically rabbits or other mam- 
malian hosts. The antisera or purified antibodies can 
then be conjugated to solid supports to provide 
5 immunoaf f inity columns for purification of the desired 
materials . 

The proteinaceous substances of the invention of 
the formula B-C-A, however prepared, can be used as 
sweetening elements in foods and beverages. Since the 

10 substances of the invention have a sweetening capacity at 
least 100 times that of sucrose, small quantities can be 
used to supplement the flavoring of juices, carbonated 
beverages, and other soft drinks. The sweeteners can also 
be used as sugar substitutes in hot beverages such as cof- 

15 fee and tea since the conformation is maintained at 

elevated temperatures. These materials can also be used 
to sweeten animal feeds . 

For those embodiments of the invention compounds 
which are encoded by DNA sequences, the materials may be 

20 produced in situ by trans fee ted microorganisms, eucaryotic 
cell lines, or plants. For example, expression systems 
containing the gene encoding the desired substance can be 
transfected into the culture organisms used in production 
of yogurt, wine, beer and the like and the sweetener 

25 produced along with the production of the desired product. 
In addition, expression systems operable in plants includ- 
ing the gene encoding the invention compounds may be used 
to trans feet explahts or plant protoplasts , and these then 
regenerated into intact plants which are then genetically 

30 capable of production of sweeter forms of fruit or 
vegetable products . 

When used as a sweetener product, the compounds 
of the invention will be typically extended by addition of 
a liquid or powders wherein the compound of the invention 

35 constitutes about 0.1-99% by weight of the composition. 
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Suitable extenders include, for example, inert powders, 
such as cellulose and may include additional helpful 
ingredients such as antioxidants, preservatives, protease 
inhibitors r and so forth. 

Examples 

The following examples are intended to il- 
lustrate the invention but not to limit its scope. 



Example 1 
Preparation of the Synthetic Gene 
The protein of the amino acid sequence shown in 
Figure 3 is encoded by a DNA sequence as there shown. As 
shown in Figure 3, nucleotides 1-141 encode residues 1-45 
15 of the native B chain preceded by a met encoding ATG start 
codon, nucleotides 142-165 encode the linking "C" portion 
of 8 amino acids, and nucleotides 166-285 encode residues 
6-45 of the native A protein. 

This synthetic gene was prepared from the fol- 
lowing oligomers, synthesized using Applied Biosystems 
380B DNA Synthesizer. 
5' — > 3' 

U 1 : TATGGGAGAATGGGAAATTATCGATATTGGACCATTCACTCAAAAC ( 4 6mer ) 
U2 : TTGGGTAAGTTCGCTGTTGACGAAGAAAACAAGATTGGTCAATAT ( 4 5mer ) 
U 3 : GGTAGATTGACTTTCAACAAGGTTATTAGACC ATGTATGAAGAAG ( 4 5mer ) 
U 4 : AG TATTTACGAAAACGAAAGAGAAATTAAGGGGT AC GAATAC C AA ( 4 5mer ) 
U5 : TTGTATGTTTACGCTTCTGAC ( 45mer ) 

U6^ (45mer) 
U7 : CCAGTTCCACCACCATAATAG ( 2 lmer ) 
LI: CGATAATTTCCCATTCTCCCA (2 lmer) 
L2: CGTCAACAGCGAACTTACCCAAGTTTTGAGTGAATGGTCCAATAT (45mer) 
L 3 : CCTTGTTGAAAGTCAATCTACCATATTGACCAATCTTGTTTTCTT ( 4 5mer ) 
L4: CTCTTTCGTTTTCGTAAATAGTCTTCTTCATACATGGTCTAATAA (45mer) 
L5 : TGTCAGAAGCGTAAACATACAATTGGTATTCGTACCCCTTAATTT ( 4 5mer ) 
35 L6 : TACCGCGGGTCTTGTAGTCTTCAGAAATGTCAGCTCTGAAAAGCT ( 4 5mer ) 

L7: TCGACTATTATGGTGGTGGAACTGGACCGTTGAATCTCAACAACTTTC (4 8mer) 
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The oligomers were isolated by urea-polyacryl amide gel 
electrophoresis and purified by passing through a Sep-pak 
C18 column (Whatman) and annealed and ligated as shown: 

5 

Ndel Sail 
Ul U2 U3 U4 U5 U6 U7 



10 1*1 L2 L3 L4 L5 L6 L7 

to obtain the synthetic gene of Figure 3 bracketed by Ndel 
and Sail sites. 

For the ligation, each oligomer was 

15 phophorylated at 37°C for 45 min. in a reaction mixture of 
30 ul containing 50 mM Tris-HCl, pH 8.0 f 10 mM MgCl 2 , 10 
mM DTT, 1 mM ATP r and 5 units of T4 polynucleotide kinase. 
Each reaction mixture was pooled, extracted by phenol/ 
chloroform, precipitated with ethanol, and dried under 

20 Speed-Vac • The dried pellet was dissolved in 50 ul 

distilled water and 7 ul ligation buffer (0.2 M Tris-HCl, 
pH 7.5, 0.1 M MgCl 2r 0.1 M DTT) added. The solution was 
placed in a 95°C water-bath and cooled slowly to room 
temperature overnight* To the mixture was added 7 ul of 

25 10 mM ATP, 40 units of T4 DNA ligase (New England Biolab 
Inc.) and 2 ul of water. 

The reaction mixture was kept at room 
temperature for 10 miny , extracted by phenol /chloro form , 
precipitated, dried and redis solved in 85 ul water. The 

30 ligated oligomer mixture was treated with restriction 
endonuclease Ndel and Sail (New England Biolabs, Inc.), 
and the 290 base pair fragment was isolated by 
electrophoresis with a 7% polyacrylamide gel, the band 
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electroeluted and purified using the Elutip-D column (S&S 
Co.)* 

M13mpl9RF was used for cloning the synthetic 
monellin gene. M13mpl9RF was cut with Xbal/Sall (New 
5 England Biolabs, Inc.), and the large fragment was 

isolated and purified. A synthetic Xbal/Ndel adaptor, 

Xbal Ndel 
5' - CTAGAAACTGCAATGTTGAATAAACGCTGATTTTCGATCA - 3' (40mer) 
10 3' - TTTGACGTTACAACTTATTTGCGACTAAAAGCTAGTAT - 5' (38mer) 

was purified, and the Ndel/Sall digested, annealed 
synthetic monellin DNA fragment prepared above was 
combined with Xbal/Sall-treated M13mpl9RF and Xbal /Ndel 

15 adaptor in 10 ul of 20 mM Tris-HCl, pH 7.5, 10 mM MgCl 2 , 
10 mM DTT, 200 units T4 DNA ligase (New England Biolabs, 
Inc.) and incubated at 4°C overnight to provide M13mpl9 
MON-1RF. The ligation mixture was transformed into hosts 
by adding 5 ul of the ligation mixture to 200 ul of E^_ 

20 coli JM101 competent cells (Messing, J., Methods in 

Enzymoloqy (1983) 101 :20-78) , and the desired sequence was 
confirmed by dideoxy sequencing (Sanger, T\ , et al, Proc 
Natl Acad Sci USA (1985) 74:5463-5467). 

25 Example 2 

Preparation of An Expression Vector 
The 293 bp Ndel /Sail synthetic gene was isolated 
from M13mpl? MON-1 RF and purified. A cpmmerci.ally avail- 
able vector, pDR720, containing the trp promoter/operatbr 
30 (Pharmacia, Inc.; Cat. #27-4930-01) was digested with 

Smal/PvuII and blunt-end ligated to produce ptrp322. The 
ptrp322 was digested with Hpal/Sall and a 2.5 kb large 
fragment isolated. A synthetic Hpal/Ndel adaptor, 
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5 r - AACTAGTACGCAAGTTC^CGTAAAAAGGGTAATACA - 3' (36mer) 
3' - TTGATCATGCGTTCAAGTGCATTTTTCCCATTATGTAT - 5' (38mer) 
Hpal Ndel 

5 was synthesized using Applied Biosystems DNA Synthesizer 
Model 380B. The ligation reaction of the 29 3 bp synthetic 
monellin gene, Hpal/Sal I- treated ptrp322 vector and the 
Hpal/Ndel synthetic adaptor was carried out in the pres- 
ence of 10 ul of 20 mM Tris-HCl, pH 7*5, 10 mM MgCl 2 , 10 
10 mM DTT, and 200 units of T4 DNA ligase (N.E. Bio labs, 

Inc.) at 14°C overnight to give ptrp322H MON-1. The liga- 
tion mixture was transformed into E . coli W3110 {ATCC 
27325) and ampicillin- resistant clones were picked* 

15 Example 3 

Production of B-C-A 

An overnight culture of 50 ul of ptrp322H MON-1 

in W3110 with Luria Broth was inoculated into 5 ml of M9 

media containing 0*4% casamino acid, 10 ug/ml vitamin Bl, 

20 40 ug/ml ampicillin and cultured at 37°C in a temperature 

controlled-shaking incubator until 0D,_- rt reached about 

650nm 

0.5. Then 0.1 mg of indolyl acrylic acid was added to the 
reaction mixture to a concentration of 50 ug/ml and the 
mixture incubated further for about 8 hr. The cultured 

25 cells were pelleted at 2500 rpm for 5 min. in a Beckman J6 
centrifuge, Laemmli protein sample buffer was added to 
the cell pellet, followed by heating at 95°C for 5 min. 
and the protein was loaded onto 15% Laenutili SDS - 
polyacrylamide gel (Laemmli, Nature ( 1970) 227:680-685) . 

30 The electrophoresis was run at 300 for 2.5 hours. The gel 
was stained with Coomassie blue brilliant dye demonstrat- 
ing a product having the correct molecular weight. The 
expressed product was isolated and shown to have a sweet 
taste. 

35 
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The product was dissolved in water and heated to 
100°C for 5 minutes. The solution was cooled to room 
temperature and the solution retasted. The sweet taste 
remained after the heating and cooling cycle. 

5 

Example 4 
Purification of B-C-A 
The cultured cells were sonicated in a buffer 
containing 0.01 M sodium phosphate, pH 7.2, 7 mM beta- 

10 mercaptoethanol, 1 mM EDTA, 25 ug/ml PMSF protease 

inhibitor. The sample was spun at 10,000 rpm for 15 min 
in a Sorvall SS34 rotor at 4°C. The extract was diluted 
with the buffer. The sample was loaded on CM-25 Sephadex 
column preequilibrated with 0.01 M phosphate buffer at pH 

15 7.2. The column was washed with the same buffer for 3 hr. 
The product single chain monellin analog was eluted with 
0.01 M phosphate buffer plus 0.1 M NaCl. After dialysis . 
against water, the purity <3f the protein was determined by 
gel electrophoresis. If necessary, the protein was 

2 0 further purified by affinity column of Sepharose CI-4B 
charged with polyclonal monellin antibodies. 

Example 5 
Preparation of Alternate B-OAs 
25 Following the above procedures, modified DNA 

sequences were prepared having different "G" linking 
sequences (between nucleotides 141 and 166 of Figure 3) as 
follows. 

30 Tyr-Glu-Asn-Arg-Glu-Asp-Ile-Lys 
Tyr-Ala-Ser-Asp-Lys r and 
Glu-Asp-Tyr-Lys-Thr-Arg- 
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The codons employed were the same as the codons indicated 
for the construct shown in Figure 4 . 
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Example 6 
Test for Sweetness 
Sweetness of the product was assessed using an 
ordinary taste test. Comparison to the sweetness of 
5 sucrose was made by suitable dilutions on a weight basis . 
Compounds of the invention have at least the same sweet- 
ness as sucrose when diluted about 1000-fold in 
comparison. 

In a typical test, 1, 10, 25 and 50 mg/ml 

10 aqueous sugar solution were used as standard solutions* 
The minimum weight of test compound which can be added to 
the water in order for it still to taste sweet to the 
tester is compared with the corresponding minimum weight 
of sucrose. The compounds of the invention require the 

15 addition of only an amount of material which is about 100- 
fold less than sucrose, for example, 60 ug/ml protein 
solution was as sweet as 50 mg/ml sugar (Lucky 
Supermarket's Lady Lee brand sugar). 

Stability under different pH conditions was 

20 measured by dissolving natural monellin (Worthington) and 
the product of Example 4 at 100 ug/ml concentration at pH 
2, 4 and 6.3. Each sample was heated to 40, 50, 60, 70, 
80, 90 and 100°C for 15 min, then let cool to room 
temperature before tasting. The most dramatic difference 

25 was that natural monellin lost its sweetness when heated 
to 50°C at pH 2 then let cool to room temperature for a 
taste test-, while the product regained sweetness even 
after heating at 100 P C for 5 min. 

30 
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Claims. 

1. A single chain proteinaceous compound hav- 
ing a sweetness property at least 50 times that of sucrose 
5 on a weight basis, of the formula B-OA, 

wherein B represents a peptide portion 
substantially equivalent to residues 1-46 of the B chain 
of native monellin; 

C is a hydrophilic, physiologically acceptable 
10 covalent linker capable of providing a spacing length 
equivalent to a peptide of 3-10 amino acids; and 

A represents a peptide substantially 
equivalent to residues 6-45 of the A chain of native 



15 



monellin. 



2. The compound of claim 1 wherein the B 
chain sequence is 

(Met) Gly Glu Trp Glu He He Asp He Gly 
Pro Phe Thr Gin Asn Leu Gly Lys Phe Ala 
20 Val Asp Glu Glu Asn Lys He Gly Gin Tyr 

Gly Arg Leu Thr Phe Asn Lys Val He Arg 
Pro Cys Met Lys Lys Thr lie. 

3. The compound of claim 1 wherein the A 
25 chain sequence is 

Gly Tyr Glu Tyr Gin 

Leu Tyr Val Tyr Ala Ser Asp Lys Leu Phe 

Arg Ala Asp lie Ser Glu Asp Tyr Lys Thr 

Arg Gly Arg Lys Leu Leu Arg Phe Asn Gly 
30 Pro Val Pro Pro Pro 

4 . The compound of claim 1 wherein C is a 
hydrophilic oligomer containing 9-30 atoms in the linking 
segment . 
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5. The compound of claim 4 wherein C has the 

formula 

Al - A2 - A3 - A4 -A5 -A6 - A7 -A8 - A9 -Al 0 
wherein each of A1-A3 represents an amino acid residue and 
5 each of A4-A10 represents an amino acid residue or is not 
present, and wherein a majority of the amino acid residues 
are hydrophilic. 

6. The compound of claim 5 wherein the amino 
10 acid residues are selected from those encoded by DNA. 

7* The compound of claim 5 wherein A2, A4 and 
A6 are acidic amino acids; A5 and A8 are basic amino 
acids; A3 is a polar neutral amino acid; and Al and A7 are 
15 nonpolar amino acids , and A9 and A10 are absent . 

8. The compound of claim 7 wherein C has the 
formula Tyr-Glu-Asn-Glu-Arg-Glu-Ile-Lys . 

20 9 . The compound of claim 1 wherein the 

covalent linker conforms the B and A chains of monellin 
to substantially the native conformation, 

10. The compound of claim 1 which is 

25 immunoreactive with antibodies raised against monellin. 

11. The compound of claim 1 which is 
immunoreactive with antibodies raised against thaumatiriy 

30 12 . A method to provide a sweet flavor to 

food compositions which comprises including a sweetening 
enhancing amount of the compound of claim 1 in said 
compositions . 
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13. A method to stabilize the three 
dimensional conformation of the B and A chains of monellin 
which comprises covalently binding a sequence 
substantially equivalent to residues 1-4 6 of the native B 
5 to a sequence substantially equivalent to residues 6-50 of 
the native A chain through a covalent linker which is 
hydrophilic, physiologically acceptable and equivalent in 
length to a 3-10 amino acid peptide, 

10 14. The method of claim 13 wherein the 

covalent linker is a peptide. 

15. A synthetic or recombinant DNA which 
encodes the compound of claim 6. 

15 

16 * An expression system effective in 
expressing the DNA of claim 15 which comprises a first DNA 
encoding a compound of the formula B-C-A having a sweet- 
ness property at least 50 times that of sucrose on a 
20 weight basis, 

wherein B represents a peptide portion 
substantially equivalent to residues 1-4 6 of the B chain 
of native monellin; 

C is a peptide of 3-10 amino acids encoded by 

25 DNA, and, 

A represents a peptide substantially 
equivalent to residues 6-45 of the A chain of native 
: monellin, . : . V ••'V^Wv 

operably linked to a second DNA sequence 
30 comprising a promoter functional in a compatible host. 

17. A cell capable of producing a compound 
having a sweetness property at least 50 times that of 
sucrose on a weight basis, of the formula B-OA, 

35 



WO 88/10265 



-22- 



PCT/US33/01825 



wherein B represents a peptide portion 
substantially equivalent to residues 1-45 of the B chain 
of native monellin; 

C is a peptide of 3-10 amino acids encoded by 

5 DNA; and 

A represents a peptide substantially 
equivalent to residues 6-45 of the A chain of* native 
monellin 

which has been transformed with the expression 
10 system of claim 16. 



18. A method to produce a compound having a 
sweetness property at least 50 times that of sucrose on a 
weight basis r of the formula B-C-A, 
15 wherein B represents a peptide portion 

substantially equivalent to residues 1-46 of the B chain 
of native monellin; 

C is a peptide of 3-10 amino acids encoded by 

DNA; and 

20 A represents a peptide substantially 

equivalent to residues 6-45 of the A chain of native 
monellin, which comprises: 

culturing the cells of claim 17, 
exposing said cells to conditions favoring the 
25 functioning of the expression system, and 

recovering the compounds from the culture . 

19y A method to produce a compound having a 
sweetness property at least 50 times that of sucrose on a 
30 weight basis, of the formula B-C-A, 

wherein B represents a peptide portion 
substantially equivalent to residues 1-46 of the B chain 
of native monellin; 

C is a hydrophilic physiologically acceptable 
35 covalent linker capable of providing a spacing length 
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equivalent to a peptide of 3-10 amino acids encoded by 
DNA; and 

A represents a peptide substantially 
equivalent to residues 6-45 of the A chain of native 
5 monellin r which comprises: 

a) preparing protected peptides bonded to 
solid supports wherein said one peptide has an amino acid 
sequence as recited for B and another peptide as recited 
for A, 

10 b) removing the solid support from the 

peptides and deprotecting the peptides; and 

c) ligating the peptides B and A to the 
covalent linker C to obtain B-C-A. 
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AMINO ACID SEQUENCE OF NATURAL MONELLIN 
A-Chain 

*PHE ARG GLU ILE LYS GLY TYR GLU TYR GLN 
LEU TYR VAL TYR ALA SER ASP LYS LEU PHE 
ARG ALA ASP ILE SER GLU ASP TYR LYS THR FIG. 1 
ARG GLY ARG LYS LEU LEU ARG PHE ASN GLY 
PRO VAL PRO PRO PRO 

45 

B-Chain 
1 

GLY GLU TRP GLU ILE ILE ASP ILE GLY PRO 
PHE THR GLN ASN LEU GLY LYS PHE ALA VAL 
ASP GLU GLU ASN LYS ILE GLY GLN TYR GLY 
ARG LEU THR PHE ASN LYS VAL ILE ARG PRO 
CYS MET LYS LYS THR ILE TYR GLU ASN GLU 

50 

* PHE presents only in10 % of natural monellin 
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